Book Reviews: Natural Language Processing: A Paninian Perspective
نویسندگان
چکیده
This book is an elementary introduction to natural language processing, but it is a very unusual one for two reasons. First, its subject languages are those of India; second, although the book sets out and discusses the better-known linguistic grammars that are commonly used in natural language processing, the grammar that the book actually uses is one based on the Astfidhy~y~ (pronotmced 'ashtad'yayee'), humankind's earliest extant grammar. To begin with, some background with regard to this grammar is in order. The Astfidhy~y~ is a grammar of the Sanskrit language, as it was spoken by the descendants of the Indo-Aryan tribe living on the Gangetic plain around the sixth century B.C. The grammar, written or compiled by P~inini, contains a little over 4,000 rules. Sanskrit, like its European cousin Latin, is a language in which the most salient grammatical device is inflection, while the least salient one is word order, and Panini's grammar reflects this. The grammar is designed so that, together with a Sanskrit lexicon and a canonical specification of a situation, it is supposed to generate all and only the correct Sanskrit sentences expressing the situation. The assumption made is that each situation to be expressed can be framed as an action or event (kriyd) with associated factors (h/raka), roughly valences. Once a one-to-one correspondence has been established between the elements of a situation canonically framed, on the one hand, and underived root morphemes from the lexicon, on the other, the grammar operates nondeterministically to yield a set of morphologically well-formed words that together express the situation. Up until the latter half of the twentieth century, Panini's grammar was the most rigorous and comprehensive ever written for any language. Moreover, many, if not most, of the key ideas in modern linguistic theory have their origin in the ideas of his grammar (e.g., sandhi) or were anticipated by it (e.g., the theta criterion). The tradition that the grammar initiated made Sanskrit, until this century, the most thoroughly studied human language. Though the languages of the Indian subcontinent belong to two distinct language families, Indo-European (those descended from Sanskrit) and Dravidian, they have much in common: in particular, they tend to be highly inflected. Hence, the treatment of inflection, and not word order, must play the most important role in the processing of such languages. English is a language in which inflection has only a marginal role, while word order has …
منابع مشابه
Book Reviews Ontology-Based Interpretation of Natural Language
A book aiming to build a bridge between two fields that share the subject of research but do not share the same views necessarily puts itself in a difficult position: The authors have either to strike a fair balance at peril of dissatisfying both sides or nail their colors to the mast and cater mainly to one of two communities. For semantic processing of natural language with either NLP methods...
متن کاملBook Recommendation Using Information Retrieval Methods and Graph Analysis
In this paper, we present our contribution in INEX 2015 Social Book Search Track. This track aims to exploit social information (users reviews, ratings, etc. . . ) from LibraryThing and Amazon collections. We used traditional information retrieval models, namely, InL2 and the Sequential Dependence Model (SDM) and tested their combination. We integrated tools from natural language processing (NL...
متن کاملComptes Rendus / Reviews
The present volume represents an introduction to the highly interdisciplinary field of natural language processing (NLP), viewed, with the eyes of a computer scientist, primarily as a subfield of artificial intelligence. The book adresses both computer scientists, and linguists who are familiar with the fundamentals of logic programming, as well as all those who are interested in the basic comp...
متن کاملBook Reviews: Computer Processing of Natural Language
This is a book about linguistic analysis involving a pair of issues that complement each other. First, there is an emphasis on representations: on grammars for describing and generating the sentences that make up a given language. Second, there is an emphasis on processing and computation: on demonstrating that the grammar for a language has important implications for how it can be processed. M...
متن کاملA Supervised Method for Constructing Sentiment Lexicon in Persian Language
Due to the increasing growth of digital content on the internet and social media, sentiment analysis problem is one of the emerging fields. This problem deals with information extraction and knowledge discovery from textual data using natural language processing has attracted the attention of many researchers. Construction of sentiment lexicon as a valuable language resource is a one of the imp...
متن کامل